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Completeness  Comparisons  Among  Sequences  of  Samples 
II.  Censoring  from  Above  or  Below,  and  General  Censoring* 

by 

IJ.  L.  Johnson 

Department  of  Statistics 
University  of  'forth  Carolina  at  Chapel  Hill 
Chapel  Mill,  North  Carolina  27514 


1 . Introduction 

As  in  [l]  we  will  consider  situations  in  which  we  have  two  sequences 
of  samples  A^  = A2  = ^21^22' respectively.  Wtej  denote 
the  order  statistics  in  A^  by 


Xijl  5 Xij2 


5 Aijr 


( r^  is  the  number  of  observed  values  in  Ajj  ) . 
For  convenience  we  will  also  use  the  notation 


Ail  '\L2 


for  order  statistics  in  a typical  member  of  A.  (i  » 1,2).  \ 

We  further  suppose  that  it  is  hnown  that  each  sample  in  one  of  the  two 
sequences  (it  is  not  specified  which  one)  is  a complete  random  sample  while 
the  other  is  a random  sample  which  has  been  censored  by  removal  of  some 
extreme  observations.  We  will  denote  the  original  sample  or  sequence  by 
placing  a bar  over  the  corresponding  symbol  — thus  hi  would  belong  to  ^ 


* This  research  was  supported  by  the  U.  S.  Army  Research  Office  under 
Contract  No.  DAft  G29-74-C-0050. 
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In  [1]  it  was  supposed  that  symmetrical  censoring  (removal  of  the  s 
greatest  and  s least  values  from  an  original  complete  random  sample  of 
size  (r+2s)  ) had  been  used  on  one  sequence.  In  the  present  paper  we  will 
consider  censoring  from  above  or  below  — for  examples  removal  of  the 
greatest  s or  the  least  s (but  not  both)  from  an  original  complete 
random  sample  of  size  (r+s). 

';Je  will  also  consider  a natural  extension  of  a general  purpose  test  of 
extreme  sample  censoring  (suggested  in  [2]  for  use  when  the  proportions  of 
greatest  and  least  values  reroved  are  unknown) . 


For  definiteness  we  will  discuss  censoring  from  above,  in  which  the  s 
greatest  values  are  removed  from  a complete  random  sample  of  size  (r+s)  — 
or  more  generally  (r—  + s). 

The  cumulative  distribution  function,  (cdf)  F^(x),  will  be  supposed 
to  be  the  same  for  each  of  the  i-th  pair  of  sarples  A^  and  • It 
will  be  assumed  to  be  absolutely  continuous,  with  probability  density 
function  f^(x)  = dF^/dx. 

•ie  will  also  use  F(x)  for  the  cdf  in  a general  pair  of  samples  for 
Aj^  and  A^  for  convenience. 


2.1  Population  Dlstrlbutlon(s)  Knov'n 


In  this  case  the  ratio  for  the  hypotheses  vs.  K. ^ where 


fh  = sequence  A^  is  censored 


3 


is 


m 


ljl2  = n 

1 j=i 


“(rii+l)ts] 

1-F(X,.  ) 2 

I3,rij 

i-F(Vr  ) _ 
zi  ' 21 

(l) 


(remember  that  censoring  is  from  above) . 

IThatever  the  value  of  s,  we  see  that  discrimination  between  and 


H2  will  be  based  on  the  statistic 


m , m 

t = n {i-F(x, • ) }{i-F(x7 • )fA  = n (zli  /z2i  ) 

j=l  l3’rlj  ^’r2j  j=l  i:,’rlj  ZJ’r2j 


(2) 


where  Z- - =1  - F(X--  _).  Generally  H, (H7)  is  accented  if  T >(<)  K. 

If  r^  = r2j  for  all  j the  likelihood  ratio  is  just  Ts  and  it  is 
natural  to  take  K=1  and  use  the  rule 


"Accept  if  T>1. 
Accept  H2  if  T<1." 


(3) 


(If  1=1,  no  decision  is  reached.) 

If  H.  is  valid  Z. . „ has  a standard  beta  distribution  with 
1 ^'rlj 


narameterr,  (s+l),r...  and  Z7  • , independent  of  Z..  , lias  a 

J-j  ,r2j  lj 

standard  beta  distribution  with  parameters  l,r2j  . 

Each  of  the  ratios  V.  = Z, • /Z7.  _ is  distributed  as  the  ratio 

3 -U  »r ^3  >r2j 

of  independent  standard  beta  variables  with  parameters  (s+1) ,r,  j and 

1 ,r7-  respectively.  Hie  V. !s  are  mutually  Independent  and  we  have 

J 

R1 

Pr[correct  decision! H^]  = Pr[  n > iC]. 

When  m»l  and  r^  * r2  * r we  have 


<*>  •> 

*•  >— 1 « i,; 


' .1- 

littiTlfrni 
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Pr [correct  decision |H^]  = Pr[V^  > 1] 


- (s*rjlr!  f1 
s! { (r-1)  ! t JO 

_ (s+r)!r!  f1 

sK(r-l)!}2  JO 


[ 2^1-Z1)I"1(l-Z2)r_1dz2dz1 

z^l-z^1"1  • r'i{l-(l-z1)r}dz1 


= . i[B(s+l,r)  - B(s+l,2r)] 

s! ( (r-1) ! t r 


_ (s+r)  !r ! 

s!{(r-l)!T 


ir  s!  (r-1) ! _ si  (2r-l)  1 
r _ (s+r)T  (s+2r) ! _ 


(4) 


This  is,  of  course,  also  the  probability  of  correct  decision  when  H2  is 
valid.  Sone  numerical  values  are  given  in  Table  1. 


TABLE  1 

Probability  of  Correct  Decision, 

Based  on  Single  Pair  of  Samples,  Each  of  Size  r 


s/r 

5 

10 

15 

20 

25 

50 

CO 

1 

0.727 

0.733 

0.742 

0.744 

0.745 

0.748 

0.750 

2 

0.841 

0.857 

0.363 

0.366 

0.868 

0.871 

0.875 

3 

0.902 

0.919 

0.92S 

0.928 

0.930 

0.934 

0.9375 

4 

0.937 

0.953 

0.958 

0.961 

0.962 

0.966 

0.969 

5 

0.958 

0.972 

0.976 

0.978 

0.980 

0.982 

0.984 

The  probabilities  in 

the  above  tables  are, 

in  fact. 

simply 

values  of 

pnxlr  < 

*2rl[y 

(4) ' 

since  (V^  > 1)  is  equivalent  to  (X^r  < X^) . 

For  sequences  A2  of  m samples,  when  m is  large  enough,  we  nay 
use  a normal  approximation  to  the  distribution  of 


t*  i 


m 

T'  = l Jin  V.  . 

j=l  3 

Using  the  methods  described  in  Section  2.2  of  [1]  we  obtain 

m 


•CtCT'l^)  = ^ [{♦*-» (s.1)  - 'Plt_AJ(rlj+s+l)} 
+ - ^Ct‘1)(r2i+1)}] 


(t-1), 


m rlj 


(5) 


= (-l)t(t-2)!  I [ l (h+sy^'V  + (-l)*  l h"(t-1)]. 
j=l  h=l  h=l 


If  = r for  all  j = l,2,...n,  then 


r r 

c.CT’lHJ  = n • (t-2) ! [ £ h'(t_1)  + (-l)1  J (h+sf (t>1)]. 
r 1 h=l  h=l 


(6) 


2.2  Population  Distribution  Mot  Known 

Using  the  method  described  in  section  3 of  [1]  we  suppose  that  among 
Tp  r2  observed  values  from  samples  in  sequences  Ap  A2  respectively 
(with  a common  distribution)  tiie  G2  greatest  are  from  A2  . Under  , 
there  were  originally  (r^  + s)  values  in  a complete  random  sample  A^ 
from  which  the  greatest  s have  been  removed  to  form  A^  , while  the  r2 
values  from  A2  represent  a complete  random  sample.  The  number  of  order- 
ings of  the  original  (r^  + r2  + s)  values  which  would  produce  an  ordering 
witli  tne  G2  greatest  values  from  A2  , after  censoring  in  the  way  described 
is 

V* 

s 

and  the  corresponding  likelihood  is 


1 

1 


* 

I 

I 


i 
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4^4  = UG2n)Cs]/(G1+l)[s;:‘}  * {(Vl)[s;i/(r2+l)W}.  (7) 

If  Tj  = r7  = r then  the  likelihood  ratio  is  just 

(G2+l)[s-i/(G1+l)[s:i.  (8) 

Since  one  of  G^  and  G2  must  be  zero  (and  the  other  positive)  we  see 
that  the  likelihood  ratio  approach  leads  to  the  decision  rule 


"Accept  IT  if  Gi  > 0 (i  = 1,2)." 


(9) 


Since  Gj  > 0 is  equivalent  to  jLlr  > X2r  this  rule  is  equivalent  to  the 
rule  (3)  obtained  from  the  approach  based  on  knowledge  of  the  population 
distribution. 

The  probabilities  of  correct  decision  given  in  Table  1 thus  apply,  and 
we  see  that  in  this  case  we  appear  to  lose  nothing  by  not  knowing  the 
population  distribution. 

The  situation  is  different  when  we  have  more  than  one  sample  in  each 
sequence  (n>l).  If  we  suppose  r.  • = r for  all  i and  all  j,  we  are  led 
to  the  rule 


n 


"Accept  II,  if  H 
1 j-1 


^1) 


[si 


1ST 


> 1. 


.CGlj+l)' 

.Accept  .12  if  the  ratio  is  less  than  1." 


(10) 


(If  the  ratio  equals  ls  no  decision  is  reached.)  The  pairs  (Gjj>G2j) 
for  j = l,...m  are  mutually  independent,  and  the  joint  distribution  of 
G^  and  G2j  if  is  valid  is 


7 


Pr[(G1j=g1)n(G2j=0)  l^]  = 
Pr[(Glj=0)n(G2j=g2)  !!,]  = 


2r-g.,-l 


J1 
r-1 


(2r+s) 
r 


P,2+s 

s 


f2r-p0-l) 


"g2‘ 

r-1 


f 2r+sl 
s 


(11) 


(fi^>S2  - 1»2». . • »r). 

The  possible  values  of  the  ratio 


Wj  = (G2j  + l)Cs]/(Glj  + l)[s] 


i si  “l 

are,  in  increasing  order,  s!{(g+l)L  J}  for  g = r,  r-1,  ...  1 and 
(s!) ’1(g+l)LS^  for  g = 1,2,. . .r. 

For  snail  values  of  n.  and  r it  is  possible  to  evaluate 

n. 

?r[correct  decision]  = Pr[correct  decision|fL]  = Pr[  II  I‘J.  > 1|H,].  (12) 

3=1  J 

For  example  if  m=2,  we  have 


Prfcorrect  decision]  = ^[correct  decision  on  each  of  ’^2p 

and  (A^2,A2,)  separately] 

+ Fr[one  correct  and  one  incorrect  decision] 
x PrD'lj*^  > l|Hj_  > 1;  W2  < 1] 


■ {■  ■ mm  • !(”'f 1 „s  j 

°1  °2 


R.,<r 


g2+s 

s 


’2r-g2-l 

2r-g1*l 

r-1  . 

. r-l  . 

(using  (4)  and  (11)  ).  (13) 


The  double  summation  is  conveniently  evaluated  as 

h'1  fZr-B.-ll 

I 


r 

l 

e2=2 


g2+s 

s 


2r-g2-l 


r-1 


E1 


=1  ^ 


'Ki- 

r-1 


(14) 
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With  m>l  there  is  the  possibility  of  no  decision  being  reached.  For 


n=2,  the  probability  that  no  decision  is  reached  is 


2 


-2 


fs+sl  l'2r-g-l'l2 

l»Jl  r-lj  • 


(15) 


TABLE  2 

Probabilities  of  Correct  Decision  and  No  Decision  when  m=2 


s/r 

5 

10 

15 

Correct 

No 

Correct 

No 

Correct 

No 

1 

0.692 

0.068 

0.717 

0.058 

0.726 

0.055 

2 

0.842 

0.039 

0.370 

0.030 

0.880 

0.027 

3 

0.915 

0.023 

0.940 

0.015 

0.946 

0.012 

4 

0.953 

0.013 

0.972 

0.007 

0.977 

0.005 

S 

0.972 

0.003 

0.986 

0.004 

0.990 

0.003 

3.  j 

General  Censoring  of  Extremes 

If  we  do  not 

know  the  numbers  s',  s" 

of  least  and  greatest  observa 

tions  removed  from  one  of  two  samples  (or  from  each  sample  in  one  of  two 
sequences)  — though  we  do  know  that  such  removal  has  taken  place  from  just 
one  of  them,  we  cannot  use  the  direct  likelihood  ratio  approach.  Results 
obtained  in  [2]  suggest  that  we  might  use  criteria  based  on  quantities 
D — = G—  + L—  where  G — (L  — ) is  equal  to  the  number  of  values  among 
the  sample  A—  which  are  greater  (less)  than  any  values  in  A,  . . . 

If  we  simply  Iiave  one  pair  of  samples  (n=l)  of  equal  size  r,  then 
we  would: 

“Accept  if  G^  + < G2  + 1^ 

Accept  if  > G2  + L2 

Reach  no  decision  if  = G2  + 


' 
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This  is  the  sane  test  as  that  obtained  in  [1]  (see  (30'),  page  13)  for 
the  case  when  the  censoring  is  known  to  be  symmetrical. 

If  m is  greater  than  1,  the  value  of  D^j  might  be  combined  by 
either  multiplication  or  summation,  leading  to  the  rules: 


or 


m m 

"Accept  IL(H?)  if  n (D..  + 1)  <(>)  n (D?.  + 1)" 
1 Z j-1  j-1 

m n 

Accept  H,  (H7)  if  n D, . <(>)  £ D7. 
i i j=1  13  j-l  ^ 


(16) 

(17) 


In  both  (16)  and  (17) , no  decision  is  reached  when  there  is  equality. 
Neither  of  these  rules  is  the  same  as  that  suggested  for  symmetrical 
censoring  in  [1]  (foot  of  page  12). 

An  approximate  assessment  of  tile  properties  of  procedure  (17)  can  be 
based  on  normal  approximation  to  the  distribution  of 


\ - 11  <cij  - v 

The  first  and  second  moments  of  (D-^  - D2j)  under  are  derived  in 
the  Appendix  (equations  (A12)  and  (A27)  ).  Numerical  values  for  the 
expected  value  and  variance  of  (D^  - ) under  are  given  in  Table 

3.  The  variance  under  H2  is  the  sane  as  under  , while  the  expected 
value  has  its  sign  reversed  (positive  under  JI2  , negative  under  ). 
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TABLE  3 


foments  of  (D-j-D-,) 


X 

s' 

s" 

5[D1-D2|H1] 

var(D1-D2jH1) 

s.d.  (D^D^ 

Hx) 

n* 

m** 

15 

2 

2 

-3.852 

8.993 

3.000 

4 

6 

3 

1 

-3.709 

9.025 

3.014 

4 

7 

4 

0 

-5.254 

8.912 

2.935 

5 

9 

2 

1 

-3.073 

9.323 

3.053 

6 

10 

T 

J 

0 

-2.632 

9.010 

3.002 

7 

12 

1 

1 

-2.271 

9 • 56c> 

3.093 

11 

13 

2 

0 

-2.014 

9.177 

3.029 

13 

22 

1 

0 

-1.163 

9.340 

3.056 

33 

66 

0 

0 

0 

9.015 

3.002 

- 

- 

10 

2 

1 

-2.647 

7.589 

2.755 

6 

11 

3 

0 

-2.243 

7.203 

2.685 

8 

14 

1 

1 

-1.936 

7.953 

2.321 

11 

20 

2 

0 

-1.726 

7.546 

2.745 

14 

25 

1 

0 

-1.024 

7,899 

2.810 

41 

72 

0 

0 

0 

7.313 

2.796 

- 

- 

5 

1 

1 

-1.336 

4.750 

2.179 

15 

26 

2 

0 

-1.059 

4.339 

2.033 

21 

37 

1 

0 

-0.907 

4.229 

2.057 

23 

50 

0 

0 

0 

5.063 

2.250 

- 

- 

For  m modestly  large  — say  in  s 4 — the  distribution  of 

m 

m j=l  13  2} 

vail  be  approximately  normal,  and  the  probability  of  correct  decision, 
given  either  or  Ii7  , v;ill  be  approximately 

4>(/j|E[D1-D2|M1j  /{var(D1-D2|H1)}}s).  (18) 

In  the  last  two  columns  of  Table  3 m*  gives  the  minimum  integer  value  of 
m for  which  the  approximate  probability  of  correct  decision  is  at  least 
0.99,  ie 

pf n It; 


’ varCDj^-D^I^) 


;>  2.326 


11 


and  r.i**  gives  the  minimum  value  for  which  the  approximate  probability  of 
correct  decision  is  at  least  0.999. 

As  r**>,  ( s'  and  s”  remaining  constant)  the  limiting  values  of 

expected  value  and  variance  are 


lim  EtD.-D^Ih]  = -(s’+s'  +2)  + 2's’  + 2_s' 


(20) 


and 


- Q * 


S1  .-s' 


lira  var(D, -D?|H,)  = 2(s’+s:,+2)  + (2sr+s)2  3 + (2s!+s)s  3 - 4 3 - 4 

(21) 

O'.'e  use 

= 2 C lin{(2^)/(2^c)>  = 1» 

XMoo 

lta{Q/t2^)}  - 0.  ) 

Table  4 gives  some  numerical  values  of  these  limits  and  corresponding  values 
of  m*  and  m**.  From  Table  4 it  appears  that  the  limiting  values  are  not 
closely  approached  unless  r is  about  50  or  more. 


TA3LE  4 

Limiting  Expected  Value  and  Variance  as  r-*»  (s',  s"  constant) 


2 

3 

4 
2 
3 
1 
2 
1 
0 


s" 

2 

1 

0 

1 

0 

1 

0 

0 

0 


lim  E[D 

-5.5 

-5.375 

-4.984375 

-4.25 

-3.875 

-3.0 

-2.75 

-1.5 

0.0 


lin  var(D, -D2|H^) 

16.375 

16.609375 

16.3046875 

15.484375 

15.359375 

14.50 

14.1875 

13.25 

12.0 


ra*  m 


3 

4 

4 

5 

6 
9 


6 

6 

7 

9 

10 

16 


11  18 
32  57 


• v7"T.  •*'  t.  "-  — « 

t . • ; : ’ •>*•:.«  i* 

'*  1 
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The  decision  rule  (16)  is  just  as  simple  to  apply  as  (17),  but  the 
distribution  theory  associated  with  it  is  rather  more  complex. 

We  can  examine  the  relationship  between  (16)  and  (17)  in  a simple  case 
(r=2,  s*=l,  s"=0)  by  direct  calculation  of  probabilities.  In  this  case 
there  are  (^)  = 10  possible  configurations  of  the  combined  sample 


in  two  = 0 and  D2  = 2,  and  in 


n f<2  . In  seven  of  these  = D2 
one  = 2 and  D2  = 0. 

Now  (16)  and  (17)  each  lead  to  acceptance  of  (which  is  valid 
since  s'>0  ) when  there  are  more  pairs  of  samples  with  one  of  the  two 
(D^  = 0,  D2  = 2)  configurations  than  there  are  with  the  single  (D^  = 2, 

D2  = 0)  configuration.  There  will  be  no  decision  when  the  number  of  pairs 
of  samples  is  the  same  for  each  of  the  two  types  of  configuration. 

The  probability  of  correct  decision  is 

l l cj)  (f)h(|)j'h 

j=i  j 10  10  h-CHjJ+i 

= (0.7)n  l (®)7‘j  f (j)2h 
j-1  1 h-CVJ3+l  11 

where  [hi ] denotes  the  integer  part  of  hj • 

The  probability  of  no  decision  is 


(22) 


Clin] 


- (0.7)lil(m!)  I ' {(n-2j)h)*1(j!)'2(^)j  . 
j=0 

(Of  course,  we  take  (^)  =1.)  Alternatively,  from  the  point  of  view  of 
procedure  (17),  we  need  the  distribution  of  T^  * (Djj-I^j).  The 


(25) 
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distribution  of  is 

PrCDjj-Dy  = d|!!,]  = 

and  the  probability  generating  function 

1 ^*■2  . . .2-.  1 " 2 ^ « ~ 2 . 4v 

r<j(2t  +7  + t)=j>j-t  (2  + 7t  + t). 

The  probability  generating  function  of  T is  10"mt"2n(2  + 7t2  + t4)r‘  . 
Hence  the  probability  of  correct  decision  when  is  valid,  using  (17),  is 

PrCljjj  < 0 |H^]  * 10  n x (sums  of  coefficients  in  terns  of  ta 

with  a<2n  in  (2+7t2+t4)m  ) . (24) 

The  probability  of  no  decision,  when  using  (17)  is 

Pr[Tm  ® 0|Hj]  = 10'n(coefficient  of  t2n  in  (2+7t2n4)m  ).  (25) 

Table  5 contains  a few  numerical  values.  The  values  given  by  (22)  and 
(24),  and  by  (23)  and  (25)  are  of  course  identical. 

In  general,  procedures  (16)  and  (17)  will  not  be  identical,  but  this 
example  is  of  interest  in  showing  that  they  can  be. 


r for  d=-2 


TJ 

1 

Iff 

is 


for  d=0 
for  d=2 


TA3LE  5 

Decision  Probabilities  when  H,  (or  H,)  is  Valid 
Under  Either  Rules  (16)  or  (17)  when  r*2,  s'®!,  s"*0 


m 

Correct 

No 

2 

0.32 

0.53 

4 

0.45 

0.36 

6 

0.53 

0.28 
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APPE  .DIX 


foments  of 


(i)  V'e  will  use  the  convention  (*)  = 0 if  a<b. 
(ii)  V»'e  will  use  the  relationships  (a<b) 

j <j)  - O - (r?x) 

j=a 

r ,a+j.  fb+r-j->  _ ,a+b+i+r.  ,a+b+l+r.. 

. n 1 a n b ' “ 1 a+b+1  J ~ 1 r 


(Al) 

(A2) 


For  convenience  we  will  omit  the  subscript  denoting  order  in  the  series 
ar»d  consider 


D1  ‘ °2  = L1  + G1  ’ L2  G2 


where  L^(fb ) is  the  number  of  values  in  less (greater)  than  any 
values  in  A^  ^ . 

V'e  introduce  indicator  variables 


y _ /l  if  the  h-th  order  statistic  of  A.  is  less  than  any  value  in  A,  • 
L ih  (0  otherwise  1 

and  , defined  as  jX-j;n  but  with  'less'  replaced  by  'greater'. 

Then 


and 


ii  lXi-‘  ■ 0i=jiGXih 


(A3) 


E[Lx?h]  ■ PnLxih  - 13;  el^]  - PrC^  - i] 


(a  - 1,2) 


EVih  LXilc3  " ECLXik!  ^ k,h 

E[GXih  (?i'J  " E^ih3  £or  k2h- 


(4.1) 

(4.2) 


. * 


' - ir  ii  *ifr ir --1- 


m 


IS 


Under  H,  , the  total  number  of  ways  of  arranging  the  original 
(r+s'+s'O  values  in  and  the  r values  of  A2  in  ascending  order  of 

2r+c»+c'' 

magnitude  is  ( r ).  Consequently 

lumber  of  arrangements  for  which  jX^l 

E^LXilJHl^  " ,2r+s;+s‘\ 

r ’ 

and  siiailarly  for  expected  values  of  otner  quantities.  It  is  convenient  to 
introduce  the  symbols 


■?* 


CMHX3  - C2r+Sy+S")E['  |hx] 


CA5) 


in  our  calculations. 

Vfe  will  use  pictorial  representation  to  indicate  the  derivation  of  the 
various  combinatorial  formulae. 


EVihi^  - c2rT  'h> 


s’+h 

r+s"-h 

a2  0 

r 

(A6) 


F*r  v lr  1 - SY  fu+h-lw2r+s'+s''-u-h. 
E ‘■L'i2h'I,l-!  - 1 n-1  n r-h  J 


(A7) 


From  (A3)  and  (A6) 


-*i-t  i-ji.  r f2r+s'  -h,  _ ,2r+s'\  _ fr+s'', 
l ( r ) ~ ( ) '•r+1  ' 

h=l 


(using  (Al)  ). 


(A8) 


Similarly 


- (2£f  ) - £f). 


(AS) 


From  (A3)  and  (A7) 


, •■'fi  ■' 

. . • ' '•  * 

• •>  • «> 
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s'  r 


■ je  £ 

= I (2r+J_iS  ) (usinji  (A2)  ) 
u=0  r 1 

- (s-dc2^:!5'). 


Similarly 


:*[G2|Hj]  > (S  .l)(2r*^s") 


(A10) 

(All) 


From  (A3)  - (All) 


(2i-+s  1 , + , 2r+s' ' _ fr+s  * , _ , r+s'  \ , , „ „ , 2r+s ' +s'\ 

E[D.-D-|H.]  « ---1*1  r+1,  V1  } ( K III  } 

1 21  1J  ,2r+s'+s;'\ 

^ r -1 

We  now  turn  to  the  evaluation  of 


(A12) 


e[(d1-d2)2|h1]  = ECL2!^]  ♦ ECG2!^]  ♦ eCl2)^]  + eLg2^]  ♦ 2e[l1g1|h1] 

+ 2E[L2G2|i!1]  - 2E[L1G2|H1]  - 2E[L2G1|H1].  (A13) 

(Mote  that  L^L2  are  G^C-2  are  always  zero.) 

In  view  of  (A4.1) 

r 7 r-1  r 

V4* 


Nov/ 


E,[LIIV  - ^ L ih  * 2 l I LXih  lXJX 

• E‘ai  * 2 j,  »-«  lW- 


(A14) 


E*U,  fc-y  »-i)(2r*fk) 

i.-i  ]<=1  1 


r-2  j 

.i  i (r+s;+y) 

Jx0  n«0  r 

r-2 


x «r*s;:r>  - er» 


j*0 


r+1 


(cont . ) 


' •-r-zrr  **  r»* 

. . '*•  .*.<  - ' 
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= f2r+S’')  - ( 
1 r+2  ' l 


r+S'!+1)  - (r-l)£f) 


r+2 


_ ,2r+s‘\  . fr+s\  _ fr+s!\ 
" ( r+2  3 lr+2  J “ lr+l  J 


and  so 


-,*ri2it,  i _ r2r+s'\  fr+s‘S  + ?rr2r+s'S  - fr+s'S  - rfr+s,'l} 
- ( r+2  ) Cr+i  J r+2  ' ^r+2  •*  ^r+1  ' 

,2r+s‘\  . ,,2r+s’i  ,9_+1  > rr+s S . 2rr+s''l 

" ( r+l  3 2(  r+2  3 ' (2r+13(r+l  3 ZCr+2  3* 


(A15) 


Similarly 


Also 


EtC^Hj]  - (2£f)  ♦ 2(2™')  - (2r*l)<£i')  - 2(«'). 

e*cI  (k-D  ^ihj]  - i ft-«si  (Ug;1)(2r*s'^'u'11) 

k=l  L 1 k=l  u=0  ' 


(A16) 


E*fLxii>  Aki-v  ■ (r*k;h'1)- 
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(A20) 


s'+h|k-h-l  s ' +r-k+l 


A.,  0 


From  this 


r-1  r 


e*Cl1c1|h1]  = l l ( 

1 1 1 h*l  k=h+l 


r+k-h-1. 


= y (2r-^  = f2r  . 
I >.r+1  J (.r+2J 


(A21) 


(using  (Al)  twice). 


Also,  with  k>h  agAin 


u=0  v=0 


'A  y Ih  1 - Ai+h-1-.  rr+s'+s"-u-v+k-h-l.  ,v+r-k% 

“ lLA2h  gW  1 J ( h-1  X k-h-1  )(r-k>' 


CA22) 


Aj  uss*  r+s'+s  '-u-v  vss" 


A7  h-1  x k-h-1  *r-k 


Hence 


% V rv*T~K  /-u+h-1^  .-r+s'+s’-u-v+k-h-l-* 


e*[l7gj!l]  = y y y (v+r:-)  ^-iww+s-^-ii-L 

2 2 1 u-0  v=0  k=2  r_k  h-1  u r+s'+s  -u-v  ' 


•iii  rr-hrs'T^ 

u=0  v-0  k=2  v k'2 


■ i i (2r*!:;s’') 


(using  (A2)) 


u=0  v=0 


= (s'+l)  (s"+l)  (2r+^2S'  ) ' 


(using  (A2)  again) 


(A23) 


’•  Vi.  'T>-T 

. *•  • 


ii.UiUIIPIR 


To  calculate  Ii*[L1G2 |H^]  we  need  (for  amj_  h,k  = l,...r) 
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EVih  0*2*1 V ■ l c V H 

V='J 


v+r-lu  ,r+s!,-v-h+k-l> 


r+s"-v-h 


^2  s'+h  r+s'-v-h  vss" 
A2  0 k-1  r-k 


wiience 


s'1  r r 


e*..l,g7|h,]  ‘lit  (v*I"!lHr*s’,Tht'1) 

121  v=rj  hil  k=l  v r+s  -v-h  1 


- 1 i (2r;?;Mi) 

v=0  h*l  r 1 


Similarly 


- i c2r;s")  - (r;s,i) 

v=0  r r 

- (s"+l){(2r*si')  - (r*S'')}. 


= (s'+i){(2r;s’)  - cr;s’)}. 


(using  (A2)) 


(using  (Al)) 


Combining  (A13,  15,  16,  13,  19,  21,  23,  25,  26)  we  obtain  after  some 


rearrangement 


ECayD/lHj]  . (2r*s'*s")-la,{2|.2r*s)  , (2r*s,  . 2(s,1K2r*5, 

- Z(£f)  - C2r»l)Q)  * 2(s*l)(r*5)) 

* (s'+s"+2)(2r+*’jSi , 4 (s'.s'42)C3'4s  '45)C2r^'*s”) 

* (A27) 


where  £'  denotes  summation  over  s--s' , s=s". 


20 


As  a check  we  note  that  for  r=l , when  s ♦ G2  = 1 and  as 

Ej  - D,  « 0 identically,  (A12)  gives  E(D^-D2|H^]  = 0 and  (A27)  gives 
var (0^-32!^)  » 0. 

Of  course,  if  in  a sequence  of  sanple  pairs  each  of  size  1 we  had  a 
preponderance  of  pairs  with  = 1 =*  G2  we  would  suspect  that  one  of  the 
two  sanples  had  been  censored  unsymmetrically,  but  we  would  not  be  able  to 
decide  which  one. 

If  s'  = s'*  = 0 then,  as  we  would  expect,  from  (A12) 


ES-Di'E^I^  = 0 (for  any  r) 


and  from  (A27) 


varOj-DjIHp  - Afr1  * $£0^  ■ 
Distribution  of  (L-j,  G2) 


(A28) 


The  joint  distribution  of  L^,  Gp  L0 , G2  when  Hj,  is  valid  is  given 
by  tne  following  table. 

h 81  H s2  (2rf5^s,')pt[^(i.-t1}Ji(o1*gi)] 


so  >0  0 0 

■V 

>0  0 0 >0 

0 >0  >0  0 

0 0 >0  >0 


2r-2-£1-g1 

r-2 


’2r-2-il1-g2' 

s’'+g2 

; r"ll_1  ; 

0 

<=" 
L 5 

'2r-2-£2-gi 

s'«2 

• r'sl'1  - 

s* 

’2r-2-(L2-g2' 

s'^2 

’s"*%2 

. 'r"2  . 

0 

s’ 

0 

s" 

- •*- 
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Calculation  of  the  moments  of  (D.,^)  from  the  joint  distribution 
of  7_^  and  r>2  does  not  seem  to  be  as  convenient  as  calculation  using 
indicator  variables.  This  method  is  easily  extended  to  the  case  when  r^ 
and  T2  y the  sample  sizes  in  Ap  A2  respectively,  are  not  necessarily 
equal.  However,  the  appropriate  test  criteria  would  then  be  different, 
and  so  the  results  will  not  be  given  here. 

Tuhey  [3]  uses  a test  statistic  (for  differences  between  two  samples) 
which  in  our  notation,  would  be  defined  as:  — if  ^ * 0,  the  value  of 

the  statistic  is  (L^  + G^).  (If  L^G-,  = = 0,  no  value  is  assigned 

to  the  statistic.)  Tne  reader  may  find  the  derivation  of  the  distribution 
of  the  statistic  in  [3]  of  interest. 
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